|
|
Accession Number |
TCMCG075C07772 |
gbkey |
CDS |
Protein Id |
XP_007045612.1 |
Location |
join(38401865..38402085,38402180..38402297,38402465..38402653,38402740..38403162,38403266..38403367,38403478..38403678,38403779..38403953,38404039..38404310,38405050..38405145,38405244..38405609) |
Gene |
LOC18610083 |
GeneID |
18610083 |
Organism |
Theobroma cacao |
|
|
Length |
720aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007045550.2
|
Definition |
PREDICTED: homeobox-leucine zipper protein HDG2 isoform X4 [Theobroma cacao] |
CDS: ATGTTCCAGCCTAACATGATGGAAGGTCAACTCCACCCTCTCGAGATGACCCAAAACACATCCGAAAGCGAGATTGCTCGAATGAGAGACGAGGAATTCGACAGTACAACCAAATCCGGTAGCGAGAACCATGAAGGTGCCTCTGGGGATGATCAAGACCCTCGTCCCAAAAAGAAGCGCTACCATCGCCATACCCAGCATCAGATCCACGAAATGGAGGCATTTTTCAAAGAGTGTCCGCACCCAGATGACAAGCAAAGGAAAGAACTTGGGCGTGAGTTAGGGTTAGAGCCATTGCAAGTGAAATTTTGGTTCCAAAACAAGCGCACCCAAATGAAGACCCAGCATGAGCGCCAAGAGAACACACAGCTTCGTACCGAGAACGAAAAGCTAAGGGCTGACAACATGAGGTTCAGGGAAGCTCTGAGCACTGCCTCATGCCCAAATTGTGGAGGTCCAACTGCTGTAGGACAAATGTCCTTCGATGAACACCATCTCAGACTTGAAAATGCTCGATTAAGAGAAGAGATTGATCGCATATCAGCAATAGCTGCAAAGTATGTTGGCAAGCCAGTGGTGAACTATCCTCTTCTTTCGTCTCCTATGCCTCCTCGACCACTTGATTTCGGTGCACAACCTGGAACTGGGGAGATGTACGGTGCTGGGGATCTTCTTAGGTCGATCAGTGCGCCTAGTGAGGCTGATAAGCCCATGATTATTGAGCTTGCGGTTGCGGCTATGGAGGAACTAATCAGAATGGCTCAGATGGGTGAACCTTTATGGATGACCAGCCTTGATGGCACAACCTCGATGCTGAATGAGGAAGAGTACATTAGGACGTTTCCTAGAGGAATTGGGCCAAAACCTACTGGCTTCAAATGCGAAGCTTCAAGAGAAACTGCTGTTGTTATCATGAACCACATTAACCTTGTCGAGATTCTCATGGATGTGCACCAATGGTCAACTGTGTTCTCGGGAATAGTTTCGAAGGCTTCGACTTTGGATGTCTTGTCAACAGGGGTAGCAGGGAATTATAATGGAGCCCTGCAAGTGATGACAGCTGAATTTCAAGTTCCTTCACCTCTTGTTCCAACTCGTGAGAGTTATTACGTAAGATATTGCAAACAGCATGCAGAGGGAACTTGGGCTGTGGTTGATGTTTCCTTGGATAACTTACGCCCTAGTCCAACAGTGAGATGCCGAAGAAGGCCATCGGGGTGCTTAATTCAAGAAATGCCCAATGGGTACTCAAAGGTTACATGGGTTGAGCATGTAGAAGTTGATGATAGAGGTGTTCACAATCTGTACAAGCAGCTGGTAAGCTCCGGCCATGCTTTCGGAGCAAAACGTTGGATTGCTACTTTAGATCGACAGTGTGAGAGGCTTGCAAGTGTTATGGCTACTAACATTCCCACTGGTGATGTTGGGGTCATAACAAATCAAGATGGGAGAAAGAGTATGCTGAAGCTAGCTGAGCGGATGGTAATAAGTTTCTGCGCAGGAGTGAGTGCCTCTACTGCTCACACATGGACTACATTATCAGGAACTGGGGCTGATGATGTTAGGGTCATGACTAGAAAGAGTGTTGATGATCCAGGCAGACCTCCTGGCATTGTGCTAAGCGCTGCAACTTCCTTCTGGCTTCCTGTTTCACCCAAGAGGGTATTTGATTTCCTCCGAGATGAGAATTCTCGAAGTGAGTGGGATATTCTTTCTAATGGTGGAGTTGTCCAAGAAATGGCACACATTGCTAATGGTCGGGATACAGGCAATTGTGTTTCACTACTTCGGGTAAATAGTGCAAATTCAAGCCAGAGCAACATGCTGATTTTACAAGAGAGTTGCGCTGATCCAACAGCCTCTTTCGTAATCTATGCTCCTGTCGATATTGTTGCAATGAATGTAGTGCTAAATGGAGGGGATCCGGACTACGTGGCCCTTCTTCCCTCAGGCTTTGCTATTCTGCCCGATGGAACTACAGCAAGTGCAGGTGGCATTGGTGATGCCGGCTCTGCTGGTTCTCTTCTGACTGTTGCATTTCAGATTTTGGTCGACTCTGTTCCTACTGCAAAACTTTCTCTTGGATCGGTTGCAACAGTTAACAATTTGATTGCATGCACTGTTGAAAGGATAAAGGCTTCACTGTCATGCGAGAATGCATGA |
Protein: MFQPNMMEGQLHPLEMTQNTSESEIARMRDEEFDSTTKSGSENHEGASGDDQDPRPKKKRYHRHTQHQIHEMEAFFKECPHPDDKQRKELGRELGLEPLQVKFWFQNKRTQMKTQHERQENTQLRTENEKLRADNMRFREALSTASCPNCGGPTAVGQMSFDEHHLRLENARLREEIDRISAIAAKYVGKPVVNYPLLSSPMPPRPLDFGAQPGTGEMYGAGDLLRSISAPSEADKPMIIELAVAAMEELIRMAQMGEPLWMTSLDGTTSMLNEEEYIRTFPRGIGPKPTGFKCEASRETAVVIMNHINLVEILMDVHQWSTVFSGIVSKASTLDVLSTGVAGNYNGALQVMTAEFQVPSPLVPTRESYYVRYCKQHAEGTWAVVDVSLDNLRPSPTVRCRRRPSGCLIQEMPNGYSKVTWVEHVEVDDRGVHNLYKQLVSSGHAFGAKRWIATLDRQCERLASVMATNIPTGDVGVITNQDGRKSMLKLAERMVISFCAGVSASTAHTWTTLSGTGADDVRVMTRKSVDDPGRPPGIVLSAATSFWLPVSPKRVFDFLRDENSRSEWDILSNGGVVQEMAHIANGRDTGNCVSLLRVNSANSSQSNMLILQESCADPTASFVIYAPVDIVAMNVVLNGGDPDYVALLPSGFAILPDGTTASAGGIGDAGSAGSLLTVAFQILVDSVPTAKLSLGSVATVNNLIACTVERIKASLSCENA |